
*************************************************************************
* Author:          SA, NB, JU, AM, DT, RK, SV
* Project name:    REDI-SPI-ISEP
* File name:       ReadMe.txt
* Institution:     CEEW, SPI, JHU SAIS, Harvard, Columbia, Houston, SPI
* Project purpose: Rural electricity demand and perspectives toward mini-grid and grid electricity
* File purpose:    ReadMe for REDI-SPI-ISEP datasets
*************************************************************************

Table of contents:

1. Introduction
2. Explanatory Notes

*************************************************************************
*	1. Introduction
*************************************************************************

This file is the ReadMe.txt for the REDI-SPI-ISEP dataset. The zip file contains the following files, with the suffix "ENT" for files associated with the enterprise dataset and the suffix "HH" for files associated with the household dataset:

- ReadMe.txt: this file.

- DescriptionVariables.csv: a csv file that contains descriptions and key identifiers for variables.

- SummaryStatistics.pdf: a pdf file that contains summary statistics of variables

- Questionnaire.pdf: a pdf that contains the questionnaire for collected data.

- RawData.dta: a Stata (version 12) dataset containing the raw survey data.

- RawData.csv: RawData.dta as a csv file.

- CleanData.dta: a Stata (version 12) dataset containing clean data and additional data describing processed appliance wattage and estimated electricity consumption.

- CleanData.csv: CleanData.dta as a csv file.

- Report.pdf: a copy of the the related publication. 

- SamplingStrategy.pdf: a pdf that describes the sampling strategy used to select villages, households, and enterprises

*************************************************************************
*	2. Explanatory Notes
*************************************************************************

1. The differences between the RawData.dta and CleanData.dta files are as follows:

 	- In the RawData.dta files, the variables q501_appl_used1 - q501_appl_used9 were miscoded. Letters denoting responses after "c", LED light, were incremented by one letter (i.e., "d" was entered as "e", etc.). This has been corrected in the file CleanData.dta.
    
    - The CleanData.dta file also supplements the RawData.dta file with additional data describing processed appliance wattage (Watts) and estimated electricity consumption (kWh/month). Where available, lower and upper bounds for process appliance wattage are provided, and mean values are provided where observations are missing.

    - The summary statistics in SummaryStatistics.pdf are calculated using RawData.dta


2. Due to limitations in the length of variable names in Stata, some variable names in the dta files differ from those in the variable in the SummaryStatistics and Description files.

In the enterprise dataset:

    - "q335_c_mgrid_reason_no_use_other" is denoted as "q335_c_mgrid_reason_no_use_othr"
    
    - "q419_light_othersources_expenses" is denoted as "q419_light_othersources_expenss"
    
    - "q421_a_light_unsatisfied_reason_all" is denoted as "q421_a_light_unsatisfied_rsn_ll"
    
    - "q421_a_light_unsatisfied_reason1" is denoted as "q421_a_light_unsatisfied_reasn1"
    
    - "q421_a_light_unsatisfied_reason2" is denoted as "q421_a_light_unsatisfied_reasn2"
    
    - "q421_a_light_unsatisfied_reason3" is denoted as "q421_a_light_unsatisfied_reasn3"
    
    - "q421_a_light_unsatisfied_reason4" is denoted as "q421_a_light_unsatisfied_reasn4"
    
    - "q421_a_light_unsatisfied_reason5" is denoted as  "q421_a_light_unsatisfied_reasn5"
    
    - "q421_a_light_unsatisfied_reason_other" is denoted as "q421_a_light_unsatisfid_rsn_thr"
    
    - "q609_a_mgrid_notwanted_reason_others" is denoted as "q609_a_mgrid_notwanted_rsn_thrs"


In the household dataset:

    - "q223_a_cow_buffalo" is denoted as "q223_a_cow.buffalo"

    - "q419_light_othersources_expenss" is denoted as "q419_light_othersources_expenses"

    - "q421_a_light_unsatisfied_reasn1" is denoted as  "q421_a_light_unsatisfied_reason1"

    - "q421_a_light_unsatisfied_reasn2" is denoted as "q421_a_light_unsatisfied_reason2"

    - "q421_a_light_unsatisfied_reasn3" is denoted as  "q421_a_light_unsatisfied_reason3"

    - "q421_a_light_unsatisfied_reasn4" is denoted as "q421_a_light_unsatisfied_reason4"

    - "q421_a_light_unsatisfied_reasn5" is denoted as  "q421_a_light_unsatisfied_reason5"

    - "q421_a_light_unsatisfied_reasn6" is denoted as "q421_a_light_unsatisfied_reason6"

    - "q421_a_light_unsatisfied_reasn7" is denoted as  "q421_a_light_unsatisfied_reason7"

    - "q421_a_light_unsatisfied_rsnthr" is denoted as "q421_a_light_unsatisfied_reasonother"

    - "q609_a_mgrid_notwanted_resnthrs" is denoted as "q609_a_mgrid_notwanted_reasonothers"


3. In addition to the 200 villages from which households/enterprises were sampled in the original design (SamplingStrategy.pdf), the dataset includes additional observations from four villages: Nigohi, Lalia, Mathura, and Chaudhera. Observations from these villages are excluded from REDI-SPI-ISEP report to ensure balance.

